Musical and Phonetic Controls in a Singing Voice Synthesizer

نویسنده

  • Luis Vergara
چکیده

have directed the project in Hamamatsu (Japan) and have visited us regularly in Barcelona. The rest of the people in the Music Technology Group have also to be mentioned, they have been always available to give us a hand when needed. In addition, I want to express my gratitude to Luis Vergara who has directed this thesis from the Polytechnics University of Valencia. Barcelona) in collaboration with Yamaha Corporation. The Music Technology Group, MTG, is a research group working on signal processing techniques for musical production and for other multimedia applications. Apart from pursuing the development of spectral audio models, MTG is dedicated to sound models for synthesis, the processing of audio based content and other issues related to Music Technology. On the other hand, Yamaha Corporation manufactures all kinds of musical instruments and professional audio equipment for professionals and amateur enthusiasts. From its base in Hamamatsu City, southwest of Tokyo, the company is also a leading producer of audiovisual products, semiconductors and other computer-related products, electronic equipment and specialty metals. The aim of the Daisy project is to synthesize a singing voice from a musical score. That is to say, from a given musical melody and a given lyrics in a particular language (English and Japanese in our case) our goal is to obtain an output sound as though a real singer was performing a song. Of course, this is not an easy thing, to say the least. However, with the accumulated knowledge in different fields, the use of new technologies and the increasing power of computers, this objective has become achievable nowadays. The artistic and technical disciplines relevant to this project cover an impressive variety of fields: sound recording and reproduction, music performance, music perception, phonetics, computer programming, digital signal processing… We can say that this is a really multidisciplinary enterprise. This research project presented here is a continuation of an automatic singing voice impersonator application for karaoke developed by the Music Technology Group [Cano, Loscos, Bonada, de Boer, Serra, 2000]. That system morphed in real time the voice attributes of a user (such as pitch, timbre, vibrato and articulations) with the ones from a prerecorded singer. 2 Because of my education as a musician and as an engineer, I have always been willing to work in an area in which I could apply my knowledge in both fields. And from the first time I heard about the …

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Performance-driven Control for Sample-based Singing Voice Synthesis

In this paper we address the expressive control of singing voice synthesis. Singing Voice Synthesizers (SVS) traditionally require two types of inputs: a musical score and lyrics. The musical expression is then typically either generated automatically by applying a model of a certain type of expression to a high-level musical score, or achieved by manually editing low-level synthesizer paramete...

متن کامل

Improvements to a Sample-Concatenation Based Singing Voice Synthesizer

This paper describes recent improvements to our singing voice synthesizer based on concatenation and transformation of audio samples using spectral models. Improvements include firstly robust automation of previous singer database creation process, a lengthy and tedious task which involved recording scripts generation, studio sessions, audio editing, spectral analysis, and phonetic based segmen...

متن کامل

Mandarin Singing Voice Synthesis Based on Harmonic Plus Noise Model and Singing Expression Analysis

The purpose of this study is to investigate how humans interpret musical scores expressively, and then design machines that sing like humans. We consider six factors that have a strong influence on the expression of human singing. The factors are related to the acoustic, phonetic, and musical features of a real singing signal. Given real singing voices recorded following the MIDI scores and lyr...

متن کامل

Real-time CALM Synthesizer: New Approaches in Hands-Controlled Voice Synthesis

In this paper, a new voice source model for real-time gesture–controlled voice synthesis is described. The synthesizer is based on a causal-anticausal model of the voice source, a new approach giving accurate control of voice source dimensions like tenseness and effort. Aperiodic components are also considered, resulting in an elaborate model suitable not only for lyrical singing but also for v...

متن کامل

Sample-based singing voice synthesizer by spectral concatenation

The singing synthesis system we present generates a performance of an artificial singer out of the musical score and the phonetic transcription of a song using a frame-based frequency domain technique. This performance mimics the real singing of a singer that has been previously recorded, analyzed and stored in a database. To synthesize such performance the systems concatenates a set of element...

متن کامل

Synthesis and Processing of the Singing Voice

As soon as the beginning of the 60Õs, the singing voice have been synthesized by computer. Since these first experiments, the musical and natural quality of singing voice synthesis has largely improved and high quality commercial applications can be envisioned for a near future. This talk gives an overview of synthesis methods, control strategies and research in this field. Future challenges in...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2001